All Questions
31 questions
2votes
0answers
666views
random_state on train_test_split() appears to have large effect in performance metrics?
To summarize the problem: I have a data set with ~1450 samples, 19 features and a binary outcome where classes are fairly balanced (0.51 to 0.49). I split the data into a train set and a test set ...
3votes
1answer
449views
How do I use wavelet transform for feature extraction correctly?
I'm trying to classify words based on EMG signals using a support vector machine as my model. My dataset includes 15 classes (words) with 230 repetitions and 1000 features each. I already merged all ...
2votes
1answer
2kviews
Imbalanced data set with Sample weighting - How to interpret the performance metrics?
Consider a binary classification scenario whereby the True class (5%) is severely outbalanced to the False class (95%). My data set contains numeric data. I am using SKLearn and trying some different ...
0votes
0answers
221views
Results of quadratic SVM in Matlab are different from the results obtained in Python
I am trying to replicate a quadratic SVM classifier from Matlab to Python, however I am having different results regarding the accuracy. In Matlab the accuracy is 0.8955 meanwhile in Python the ...
1vote
2answers
44kviews
ValueError: y should be a 1d array, got an array of shape (285, 30) instead [closed]
I am using this data set below and I am trying to find the support vector machine of the data set. Also I have my code and error below as well. http://scikit-learn.org/stable/modules/generated/sklearn....
0votes
3answers
1kviews
Why is the accuracy of a LinearSVC not the same as the SDGClassifier?
I'm fine tuning parameters for a linear support vector machine. There are multiple ways to do it, but I wanted to compare LinearSVC and SDGClassifier in terms of time. I expected the accuracy score to ...
0votes
1answer
247views
How to include validation set in the pipeline to tune parameters for an SVM?
I have a dataset already divided into train, test and validation set. How can I insert the validation in my pipeline? Code: ...
1vote
2answers
734views
Ways to increase recall in SVM
I am training an SVM on UCI's Bank Marketing Data Set, the bank additional-full.csv. As the data is skewed I am also interested in recall. I am getting accuracy of about 87.95% but my recall is around ...
1vote
0answers
188views
Do I have to wrap multiclass SVM in OneVsRestClassifier()?
I am using an SVM for mulitclass classification between 3 labels (1,0,-1). I thought this could simply be done by using SVC(decision_function_shape = 'ovr') in my ...
-1votes
1answer
205views
How to select the best features for Support Vector Classification
I have a feature set that contains approximately 2 dozen features of technical analysis indicators. My own domain knowledge tells me that some of these features are better than others for predicitive ...
1vote
1answer
483views
In an SVM, does a more negative/positive decision score mean that it is further from the seperating hyperplane?
For example, if I have a sample with a decision score of -6 and another with a score of -3, which sample is closer to the hyperplane? Also, does the probability of a sample belonging to a class ...
1vote
0answers
617views
How to create an roc plot and calculate AUC for an svm (that does not return probabilities)?
I have some SVM classifier outputting final classifications for every sample in the test set, something like 1, 1, 1, 1, 0, 0, 0, 1, 0, 0, 1, 1 and so on. The "...
0votes
2answers
128views
Confusion on result of K-Fold Cross Validation and Independent Test set
I am relatively new in Machine Learning. I am using Random Forest and SVM for a project. Where I did a ...
2votes
0answers
404views
Why are the regions/decision boundaries overlapping with multi-class classification using SVM in sci-kit?
I am using the SVM in scikit-learn library for doing multiclass classification. I am wondering why these regions (decision boundaries) are overlapping (as seen in the picture below)? Could someone ...
2votes
1answer
27views
When to question output of model
I'm unsure of how to ask a question without making it seem like a code review question. At what point does one question whether they've actually implemented the algorithm and-or model correctly? ...